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Mathematicians just love sigma notation for two reasons. First, it provides a convenient 
way to express a long or even infinite series. But even more important, it looks really 
cool and scary, which frightens nonmathematicians into revering mathematicians and 
paying them more money. 
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§1 Introduction 


This is a handout about how to deal with complicated sums. Broadly, sums on olympiad 
contests fall into a few different categories: 


e Purely algebraic sums, like )7,, Ae a which are mostly exercise in algebraic 
manipulation. These are the kind you would get in a lecture named “sums and 
products”. 


e Sums which are combinatorial, involving expressions like aR You would see this 


in lectures named “combinatorial sums” and “counting in two ways”, or perhaps 
“eenerating functions” . 


e Sums which are number theoretic, involving functions like y, 4, or which involve 
expressions like 5> d|n> OF which ask you to take a sum modulo p. Often appears in 
“multiplicative number theory”. 


I think it is preferable to avoid separating the learning of these apparently disjoint topics, 
hence the unified handout. There are some connected themes which underlie all three; in 
particular, one of the big ideas which runs through the entire handout is the concept of 
swapping the order of summation. 


Theorem 1.1 (Swapping the order of summation) 
Let f(a,b) be a function. Then 


Eyreyys 


ac€AbeB beBacA 


This seemingly obvious fact is a nontrivial step in more than 50% of summation problems 
(in the same way that cyclic quadrilaterals is used in more than 50% of geometry problems). 
We will see this key idea again and again in the examples that follow. Thus any time 
you see a double sum, you should always consider computing the sum in the reversed 
order. In fact, even if you only have a single }> you should consider rewriting the 
expression as a double sum (e.g. a generating function) so that the above theorem applies. 
You might also need to change the variables before this step; for example we also have 

DI ae oe: 

a>0 b>0 k>0 a,b 

a+b=k 


§2 Algebraic manipulation 


These are the most low-tech problems, and often appear on short-answer contests as a 
result. These techniques apply to all sums, and especially to those which don’t have N or 
C flavor. 

Examples of things you can do: 


e Look at telescoping sequences. The classic example that everyone knows of is 


< n(n + 1) 
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e Generalizing the above point, looking at partial fractions is very often a good 
thing to do if your expression has polynomial denominators. 


e Try to factor the expression. In particular, one can factor through double sums: 


yrevrn= (Ke) (Leen), 


ac€AbeB acA be B 


e Try swapping the order of summation. (This is so important we’re saying it 
again.) 


§2.1 Telescoping and partial fractions 


I won’t provide too many examples here (though there are plenty in the practice problems!) 
since this is a topic that often appears at the AIME level anyways. So here is my token 
example. 


Example 2.1 (Stanford 2011) 


Evaluate the sum 


Ems Oy 


nol 


Solution. Again, since we have a polynomial denominator, our first reflex is to decompose 
its partial fractions. This gives us 


™m+32 16 9 
n(n+2) n n+2° 


Suddenly we’re done, because the sum telescopes according to the fact that 9 = (3/4)? -16: 
(3)’ 16 (3) 
1 n \4 n+2\4 


We see oe ar ioaaee in the sum tend to zero as n — oo. So the answer is 
x. 3 34 16 —12+2 = 3. 
is = 2 = 


§2.2 Swapping the order of summation 


For a less contrived example, we now prove linearity of expectation, which is the key 
result from [2]. 
The motivating example which I always present is that this allows one to compute the 


expected number of fixed points in a random permutation of {1,...,7}: 
Example 2.2 
A random permutation of {1,...,n} has an average of one fixed point. 


To see this, let’s look at the case n = 4: 
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Ww X Y Zid 
1|;W X Y Z| 4 
2|W X Z Yi} 2 
3/W Y X Zi] 2 
4;/W Y ZG X}1 
5|/W 24 X Yi 1 
6|;/W 24 Y X} 2 
7| xX W Y Zi] 2 
8| xX W Z Y!|]0 
9}; xX Y W Zi 

10; X Y Z W!]O0 
11| X Z W Y!]0 
12, > xX Z Y Wi 
13) Y W X Zi 
14)/Y WwW Z X/0 
15| Y xX W Zi} 2 
16>Y X Z WI 1 
17| Y Z W X/]0 
18) Y Z X W!]O0 
19| Z W X Y!]0 
20; Z2 W Y Xi 
21; Z2 xX W Yj1 
22; Z2 X Y Wy 2 
23}; Z Y W Xi} 0 
24; Z2 Y X WI 0 
x) 6 6 6) 6 | 24 


The idea is as follows: nominally, to compute the expected value, one is supposed to 
compute the number of fixed points in each row, and then take the average. However, 
the number of fixed points per row is chaotic. It would be much simpler to take the sum 
in each column, and take the average of those instead: in which case we see the answer is 
An: (n—1)!= 

The following theorem generalizes this: 


Example 2.3 (Linearity of Expectation) 


Let X and Y be random variables (not necessarily independent). Then ELX + Y] = 
ELX] + E[Y]. 


As described in [2] the proof of this is just a “double summation”. We do this formally 
here: 


Proof. For simplicity we do the case where X, Y take nonnegative integer values; for the 
general case one should use f rather than }°. By definition, we have 


X+¥)=\ nPX+Yenl=>) SO nPX=a,Y =d). 


n=0 n=0 a+b=n 


Unfortunately! we can’t write P(X =a,Y = b) = P(X =a)P(Y =b) since X and Y 
need not be independent. So instead we write n = a + b and then replace }7,39 )vato=n 


‘This was wrong in an earlier published version. Thanks to SH for noticing. 
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with )> ap This lets us split the sum into two parts, which we sum in two different orders. 
Here is the calculation: 


EIX+Y]=S° So (@+d)P(X =a,Y =d) 


n>0 a+b=n 
= 5° (a+b)P(X =a,Y =)) 
a,b>0 
=S°SoaP(X =a,Y =b) +5) So dP(Y =), X =a) 
a>0 b>0 b>0 a>0 
=) \aP(X =a)+>_ oP(Y =b) 
a>0 b>0 
= E[X]+E[Y]. 


Thus we see that: 


Every application of linearity of expectation is philosophically an 
application of switching the order of summation. 


Finally, let’s look at some number-theoretic examples. First, we prove the following 
lemma which we will revisit later. 


Lemma 2.4 (y « 1 = id) 
Let n > 1 be an integer. Then 


> 9(d) =n. 


d\n 


Proof. We will give a more pedestrian proof later on in “multiplicative number theory”, 
and so for now let’s present the nice combinatorial proof: look at the fractions 


nm 
nm 


Suppose we reduce all the fractions to simplest form. Then for every d | n, there are 
exactly y(d) fractions with denominator d. (If you don’t see why, try writing out the 
case n = 15.) Since there are n fractions total, we’re done. 


Now I can present my favorite example which illustrates just how useful swapping the 
order of summation can be. 


Example 2.5 (AMSP 2011 NT3 Exam) 


Let n be a positive integer. Prove that 


YS" ok) = | = s(n + ile 


k>1 


Proof. The key idea is to rewrite the floor as a sum involving divisors: 


9®) [EF] = le S31= OY ve). 
k>1 k>1 


m<n k>1m<n 
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Thus we’re computing the sum of y(k) over several pairs of integers (k,m) for which 
k|m,m <n. For example, if n = 6, the possible pairs (k,m) are given be the following 


table: 
(1,7). (1,2) (1,8) (i;4) (1,5) (1,6) 


(2,2) aa (2, 4) ° 
3,3 3,6 
(k,m) € (4,4) 
(5,5) 
(6,6) 


Nominally, we’re supposed to be summing by the rows of this table (i.e. fix & and run 
the sum over corresponding m). However, by interchanging the order of summation we 
can instead consider this as a sum over the columns: if we instead pick the value of m 


first, we see that 
S> SE ok) = 42S elk) 


k>1 k|m m=1 k|m 
m<n 


But we know how to evaluate the inner sum by the lemma! We get 


Y2 Tvl) = m= jn 


m=1 k|m 


§2.3 Finite Fourier analysis 


A classic MathCounts-flavored problem goes as follows: we wish to compute 


X (a) 


k>0 
(N) 


- 
1000 “> Galen 


Adding these together, we get that twice the desired sum is 2!9°° hence the answer is 
2999 The key fact we used was that 


In order to do this, we use the fact that 


1+ 1) 1000 


1"+(-1)" Jl neven 
2 ~ |0 nodd. 


This trick can be generalized using a so-called roots of unity filter. To motivate it, 
we consider the following example problem. 


Example 2.6 (Classical application of roots of unity filter) 


ey, 


k>0 


Compute 
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Solution. We can rewrite the sum as 


where 


f(n) = f n=0 (mod 3) 


0 otherwise. 


So we want the mod 3 analog of the parity detector 1” + (—1)” we had earlier. 
The trick, which gives it the name “roots of unity filter”, is that we can take 


fla) = ; (i? + w” + w?") 


where w = exp ($71) is a cube root of unity, satisfying the relation w? +w+1=0. Thus, 


> (”) = >» (y) (1+ 0" +w"), 


n>0 n>0 


we have 


We can swap the order of summation now (never gets old, does it?) and instead consider 


5 (Y= (35 (meh 


n=0 n=0 n=0 


(This doesn’t look like the previous examples of swapping the order of summation, but 
perhaps I could instead write >7,,5.9 SS ee ant ae At Ge, The point 
is we are exchanging an 1000 x 3 sum with a 3 x 1000 sum.) Thus by the binomial 


theorem the expression in question is 


1000 L 
S- ( ) £00 = ; (1 a aes afl (1 sao) si (1 + w?) 1000) 
n>0 " 
_ : [g1000 + (—w?) 1000 4 (ws) 2000} 
1. 
=5 Pe or “ti? 
= in [91000 _ 1] 
3 | . 
§2.4 Problems 
Problem 2.7 (Putnam 2011). Let a1,a2,... and bj), b2,... be sequences of positive real 
numbers such that a, = b) = 1 and b, = bn_ 1a, — 2 for n = 2,3,.... Assume that the 


sequence (b;) is bounded. Prove that 


= 1 
$= 
mEL ay or os An 


converges, and evaluate S. 


Problem 2.8 (Putnam 2013). Let ao,a1,...,@n,z be real numbers, where 0 < x < 1, 
satisfying 
ao ay an 
oe =f 
ie ie 1—_grt 


Prove that for some 0 < y < 1 we have 


ag + ary + agy? +--+ + any” = 0. 
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Problem 2.9 (Putnam 2015). Let T be the set of triples of positive integers whose 
lengths are the sides of a triangle. Compute 
oe 
bec’ 
(a,b,c)ET 35 


Problem 2.10. How many nonempty subsets of {1,2,..., 1000} have sum divisible by 
3? 


§3 Sums modulo a prime 


Sometimes you have a sum which you’d like to evaluate modulo p instead of computing 
altogether. Most of the same strategies from before still apply, and so these problems are 
not actually too different from before. 

However, when working in F,, instead of R, you can take advantage of the fact that there 
are only finitely many elements to get some useful symmetry. A simple manifestation of 
this is 


Lemma 3.1 (Fermat's little theorem) 


Let p be a prime. Then a?~! = 1 (mod p) whenever gcd(a,p) = 1. 


Proof. Multiplication by a exhibits a bijection from {1,...,p— 1} to itself. Now take the 
products to get a?~!(p — 1)! = (p—1)! (mod p), and cancel the resulting (p — 1)!. 


I’m sure you’ve already seen this lemma before, but I included the proof anyways because 
its main idea of exploiting the fact that F, is finite is often useful. 

Similarly, a sum or product from 1 to p— 1 will often cancel out in spectacular ways, 
such as: 


Lemma 3.2 (Wilson's theorem) 


For any prime p, 
(p—1)!=—-1 (mod p). 


Exercise 3.3. Prove this theorem if you don’t already know how. 


The next lemma is somewhat less well-known than it deserves to be. 


Lemma 3.4 (Sums of powers modulo p) 


Let p be a prime and m an integer. Then 


0 (mod p) ifp—l{tm 
—1 (mod p) ifp—1|m. 


EI tod] 


Proof. Suffices to show the case p— 14m since the other is obvious. Let g be a primitive 
root modulo p (see [1]), then the above sum equals 


ge2-lm = 1 


1 m wee (p—2)m — 
rg retrg gm 1 


(mod p) 


which is okay since the denominator isn’t zero. But the numerator vanishes. 
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Here is another example. 


Example 3.5 (Wolstenholme’s theorem) 
Let p > 3 be a prime. Then 


a 1 
(+5 ++) =0 (mod p’). 


Note the presence of (p — 1)! is just a formality to ensure the left-hand side is an integer; 
it makes more sense to just evaluate S = 17-14 2-!4---+(p—1)~+ (mod p?). 


Proof. Let S = 1~1+27-14---+(p—1)! (mod p?). It’s already clear (by say Lemma 3.4 
with m = —1) that p | S, but we in fact want p? | S. We exploit the symmetry by pairing 
up the opposite terms: 


a Sits (=a k)’ 


So just as before we see that S' is divisible by p, but this time we know information mod 
p*: it now suffices to show that 


p-1 1 
poe > k(p — k) 
is zero modulo p as well. But 
p-1 1 poly p-l 2 
Do p= = RH =~ DF (mod) 


k=1 k=1 k=1 


and we’re done by Lemma 3.4 applied at m = —2. (Where did we use p > 3?) 


Finally, here is a weird trick which is often useful when faced with i expressions modulo 
p. Look through the problems for a few examples. 


Lemma 3.6 (Harmonic modulo p trick) 


For any integer k = 1,2,...,p—1, we have 


Exercise 3.7. Check this. 


This is often useful because it reduces a computation involving k~!’s modulo p to a 
computation involving (2) mod p?; the latter is often easier to deal with. 


§3.1 Problems 


Problem 3.8 (ELMO 2009, John Berman). Let p be an odd prime and zx be an integer 
such that p| 2? — 1 but p{a«—1. Prove that p divides 


wp Pl 
_—1) sae ; 
(p— 1) (< oe =) 
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Problem 3.9. Let p > 5 be a prime. Prove that 
1 1 1 
pss =0 dp’). 
3 + 53 "Gas (mod p*) 


Problem 3.10 (OMO 2013 W42, Victor Wang). Find the remainder when 


100 


][a-?+ 


i=0 


is divided by 101. 


84 Multiplicative number theory 


Earlier we saw that that Dodi y(d) = n. In this section we’ll develop a more general 
theory which can handle these divisor-based sums. 

To get a grasp of what we’re doing, let’s redo the example from above. The key 
idea is rooted in the standard MathCounts formula for the sum of the prime factors of 
m= pi... pi: it is equal to 


So d= (ltpitpi+:::+p%') (L+potpot---+p52)...(L+pet+ pe +--+ + 7,")- 
d\n 


Indeed, if we expand the right-hand side, each factor appears exactly once. 
But in fact, y has a special property: if m and n are relatively prime, then y(mn) = 
y(m)y(n). (Proof: Chinese remainder theorem.) So we can copy the work above to get 


> (4) = (PL) + or) ++ + PPP) --- (PCL) + (pe) +++ + PCR)) - 
d|n 


But now we’re done, because it’s easy to see that y(1) + y(p) +--- + y(p®) = p® when p 
is a prime, and thus the right-hand side is n. 

In this short section we will grow this idea into its full generality. In what follows, N 
denotes the positive integers. 
§4.1 Multiplicative functions 
Definition 4.1. An arithmetic function is a function f : N—> C. It is 


e multiplicative if f(mn) = f(m)f(n) for relatively prime m and n. 


e completely multiplicative if f(mn) = f(m)f(n) for any m and n. 


Example 4.2 (Completely multiplicative functions) 


The following are examples of completely multiplicative functions. 
e The identity function id. 


all 
SW, 


1 
e The Dirichlet delta function, 5(n) = ‘i 


e The constant function 1, given by 1(n) = 1. 


10 
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Example 4.3 (Multiplicative functions) 


The following are examples of multiplicative functions which are not completely 
multiplicative. 


e Euler’s y. For example, y(15) = y(3)y(5). But not y(9) = v(3)y(3). 


e The Mobius function p, defined by 


(-—1)™ if n has m prime factors, all distinct 
p(n) = 


0 n is not squarefree. 
For example, 4(10) = 1, (105) = —1, but w(12) = 0. 
e a is the sum of divisors function. For example o(6) = 1+2+3+6= 12. 


e 7 is the divisor counting function. For example 7(6) = 4. 


Of course, the product of two multiplicative functions is also multiplicative. 
The nice property of multiplicative functions is that it is sufficient to determine 
their values on prime powers. For example, we have formulas like 


a4. | ma a 
y(n) = (v3 — pt ) (vo? — p3 ) wae) 
when n = pj’... ps. This is one way you can prove the formula above: the Chinese 
remainder theorem implies quickly that y is multiplicative, and it is straightforward to 
compute y(p”) = p* — p*-1, hence the result. 


84.2 Dirichlet convolution 


Now given two arithmetic functions f and g, we define the Dirichlet convolution as 


(f*9)(n) = 0 f(dg(n/d) = D> F(dg(e). 


d\n de=n 


This is more frequent than you might expect: 


Example 4.4 (Examples of Convolution) 
Verify that 


eilxl=rT. 

e lxid=a. 

e idxid=n-T. 

e 1x y = id (previous example). 


e 6x f =f for any f. 


Here are some properties of *: 
e The identity of * is the Dirichlet delta function. 


e Clearly * is commutative. 


11 
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e The operation * is associative because 
((f #9) *h)(n) =(f*(g*h))(n)= DS) f(di)g(de)h(ds). 
didgd3=n 


e It distributes over addition: f*x(g+h)=fxg+fxh. 


e Most important: the convolution of two multiplicative functions is also 
multiplicative. 


Thus we have an operation on the set of multiplicative functions. 


Exercise 4.5. Check that convolutions of multiplicative functions are multiplicative. 
(Mimic the proof of ~ we used earlier.) 


The upshot of this is that we can now use our theory to completely eradicate our 
earlier example. 


Example 4.6 (vy «1 = id) 
Let n > 1 be an integer. Then 


> 9d) = 


d\n 


Proof. Rephrasing the problem, we wish to show that 
yx*1 = id. 


It is true for prime powers n = p®, because the left-hand side is 1 + (p — 1) + (p? — p) + 
-- + (p® — p®!) = p®. But since both the LHS and RHS are multiplicative, this implies 
the problem. 


§4.3 Mobius inversion 


We now know that given a multiplicative function f, we have a good handle on 
= f@ 
d\n 


because g is the Dirichlet convolution f « 1. However, occasionally we instead have g 
and want to recover f. The way to do this is through Mobius inversion. 


Lemma 4.7 (Mobius is inverse of 1) 


We have w* 1 = 0. 


Exercise 4.8. Prove this. (Check on prime powers.) 


Theorem 4.9 (Mobius inversion formula) 


Let f and g be any arithmetic functions (possibly not multiplicative). Then 


=DI@ = flr) =) w@a(n/d). 


d|n d\n 


In other words, if g = f x1 then f=g* uw. 


12 
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Proof. Assume g = f x1. Then 


g*ma=(frl)xuafe(lep)a=freo=f. 


Here is a silly application from the IMO Shortlist. 


Example 4.10 (Shortlist 1989) 
Define a sequence (an)n>1 by >> din d = 2”. Show that n divides ap. 


Solution. By Mobius inversion, an = ale (n/d)2¢. Now just let n = p{!...p& and 
bash. Details omitted. 


§4.4 Problems 
Problem 4.11. Prove that for any integer n > 1, 


2 


d|n d|n 


Problem 4.12 (Bulgaria 1989). Let Q(n) denote the number of prime factors of n, 
counted with multiplicity. Evaluate 


1989 
1989 
4 )X(n) | a8 OF 
yon =|. 


Problem 4.13. Prove that for all positive integers n, 


u(n)= S2 cos (=) 


1<k<n 
gcd(k,n)=1 


§5 Generating functions 


A generating function is a device somewhat similar to a bag. Instead of 
carrying many little objects detachedly, which could be embarrassing, we put 
them all in a bag, and then we have only one object to carry, the bag. 

— George Pélya 


The idea of generating functions is as follows: given a sequence a, we want to understand 
we consider the formal series 
A(z) = SS oe 


n>0 
This gives us another sum that we can work with. Reasons why this could be helpful: 
e A(x) may often have a nice closed form, like 1 +2 +a?+---= — 
has said this makes it much easier to work with. 


and as Pélya 


e Often a, is itself a sum, and now we can change the order of summation. See Snake 
Oil below. 


13 
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§5.1 Toy applications 


Example 5.1 (The Classic) 
We have 


yy (Oy) _ 91000 
n 


n>=0 


Proof. Let an = ene for example. Of course, by the binomial theorem we have attached 


generating function 


A(t) = SO ant" = 7 (”) 2” = (1 +a), 


n>=0 n>0 


(1000) — 91000 


Plugging in « = 1, for example, then implies the usual identity }7,., 


Example 5.2 (The Derivative of the Classic) 


We have ar 
En ) = 1000 - 2999. 
nr 


n>=0 


Proof. Take the derivative of the previous example 


ou nz"! = 1000(1 + x). 


nr 
n>1 


This time, the we obtain }),59 mio) = 1000<2°?". 


ae 


Exercise 5.3. Compute 7,59 od ae 


§5.2 Linear recurrences 


Let’s now use generating functions to derive the closed form of the Lucas numbers Ly, 
which are defined by 


Io =2, L,=1, LInpo=LPnit ttn. 


For concreteness, the first few terms are 2,1,3,4,7, 11,18, 29, 47, 76,.... 


Theorem 5.4 (Explicit form for Lucas numbers) 
Let a = $(1+ V5) and B = $(1— V5). Then 


Ln =a” + B”. 


Proof. We consider the generating function 


14 
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We aim to find its compact form. Write 
L(a) = 2+a¢+307 + 42° + 724+... 
aL (x) = 224+ 274327 +4e4+... 
a’ L(x) = 20° +4+2°43a44+.... 


Thus we can deduce 
L(x) — #&L(2) — 27 L(x) = 2-2 
and consequently 
2=2 
2 _ = 


Now, what can we do with this? Answer: use partial fractions again. The denominator 
factors into (1 — ax)(1— 8x), and from this we deduce 


ie) = » Lae” = j = — ‘> (a” + B”) a”. 


+ 
1— 
n>=0 oe Bax n>0 


So we’re done upon equating coefficients. 


Philosophically, what’s happened is that by considering a sum, we could compactify all 
the information about L, into a single rational expression, which we could then use 
partial fractions on. 


Exercise 5.5. In the same way, derive Binet’s formula: if F, is the nth Fibonacci 


number then i 

F, = —(a” — B"). 
As an intermediate step, you should find the generating function ;—"—> for the Fibonacci 
numbers. 


In principle, every linear recurrence can be solved in this way. For more details see for 


example [5]. 


§5.3 Common generating functions 


So far, we know that 74. =1+a+2?4+... and (1+2)"=14+(j)a+ ($)2?4+.... It 
turns out we can get more formulas to add to our arsenal using the following: 


Theorem 5.6 (Generalized binomial theorem) 


Let r be any real number (not necessarily an integer). Then 


ater = 3 (")o ee (See 


n>=0 


Proof. For concreteness, let’s show the coefficient of x? is ee Suppose 


(l+a)"= Saja" = ap + a1e + agx? + age? + age*+.... 
k>0 
Take the triple derivative: 


r(r —1)(r —2)- (14+ 2)"~? = 6a3 + 24a4x + 60a5z7 +... 


Now if we compare the constant terms we get a3 = ér(r —1)(r— 2). 
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This generalized binomial theorem has several applications, described below. 


e For example, if we set r = —1 we obtain 
1 =D) ge KE . 
(eee 
n>0 n>0 n>0 


e More generally: 
Exercise 5.7. Prove that for an integer m, we have ) = (-1)* areas Deduce 


k 
that 
1 _ (m\ 6 m+i1\ , m+2\ » _ Kea) 
ue see aa ew, 
k>0 
e A second interesting example is r = —}. For convenience, we will look at (1— d4a)~2 


rather than (1 — x)72, Note that 


(1 —42)~2 = S-(-4)" -) ok 


kl . 
k>0 
ee (2k — 1) (2k — 1)! 
k k k k 
2 kl oe ko 
k>0 k>0 
_ (2k)!/k! 2h \ 
=> CUE =o (7 ot 
k>0 k>0 
Thus we get the generating function for eae Here we have used the well-known 


fact that 
2*kI(2k — 1)! = (2k)! 


If you haven’t seen this before, try to prove it! 
e Finally, let’s integrate both sides of (1 — Aa)~2 = Fin a) z*, This tells us that 


1 FON gg 1/4 1 


k>0 


for some constant C; by comparing the constant terms, we get C = 5: Dividing by 
x we derive 
3 1 2k k 1- V 1- Ag 
—— 2 = —_——_.. 
k+1\k 2x 
k>0 


The numbers C;, = pal?) are the infamous Catalan numbers. 


In Table 1 we collect the fruits of our labors above. Notice that in particular we can 
handle both of 


e sequences of the form (") (where the top is fixed while the bottom moves) 


* 


e sequences of the form (* 


) (where the bottom is fixed while the top moves). 


These formulas will be indispensable later on. 
Here is an application of the formulas. 
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ata 3 (Ge wee hi) 
aap - L(y") ee 
ae x ()* “tae 


Table 1: Table of common generating functions 


Example 5.8 (HMMT 2007 Combinatorics #9) 
Let S be the set of triples (i,j,k) of positive integers which satisfy i+ 7 + k = 17. 


Compute 
ie 
(i,5,k)ES 


Solution. The point is to notice that one can consider the quantity 
F(a@\= S> ia’ ye > kak 
7>0 j20 k>0 
and that we merely want the coefficient of x!”. But 


so we need the x!“ coefficient of aay which we know is ese 


§5.4 Snake Oil method 


The so-called Snake Oil method, so dubbed by [4], is a powerful way to force a combi- 
natorial sum into a generating-function double sum. It serves as a “miracle cure” for a 
whole class of problems, hence the name. 

Here’s how it works: suppose we have a sum we want to evaluate in terms of n, perhaps 


of the form 
a= S° F (k,n) 
k 


for some function F’. It would suffice to evaluate the generating function 
A(z) = So ana” =SOS oF lk, n)a” 
n>0 n>=0 k 
Since this is a double sum, we can now swap the order of summation and obtain 
“YP 
n=0 


With any luck the interchanged sum will be significantly easier to evaluate. This works 
best if F' is the product of several functions that we know the generating functions of 
(like in table above). 

Let’s see an example: 
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Example 5.9 ((4]) 


For n > 0, compute 


Proof. As described above, we let 


A(z) => ae o=> ("3 ante 


n>0 | k>0 k>0 n>0 
_ m+ k ~ ~ (n k) + 2k fk 
ye ere 
k>0 n>0 k>0 = n>0 
2k+1 
_ k a+2k é- k 1 
a > ( 2k ) ex) “Za (5 
k>0 a>0 k>0 
23 1 ( x oi 1 | 1-22 
i 1 2e (1 — 2x)? 1-21-G35 (1-22)? -2 
1=9 1 — 22 1/3 2/3 
To a e 


~ 15a +422 (l—42)\1—-a2) 1l-z 1-42 


Example 5.10 ((4]) 


For n > 0, compute 


Solution. Snake Oil: 


n>0 | k>0 k>0 n>0 
k 

= k n—k 
a D 

k>0  k<n<2k 

1 
k k 

=o) 

0 1—a(14+2) 
_ 1 

l—a— <x? 


Thus the sum in question is the Fibonacci number F;,+1. 


The snake oil method relies on having the free variable appearing in only one place. 
So the following example doesn’t succumb immediately to Snake Oil: 
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Prove that for n > 0 we have 
3 n 2n _ (3n 
—\i} \n-i Nay 


However, it is a special case of Vandermonde convolution, which does succumb to Snake 
Oil. So if a free variable appears in many places, this is often a hint to try and consider 
generalizations of the identity. 


§5.5 Problems 
Problem 5.11 ([3]). Prove that for every integer n > 0, 
2a\ (2b 
=A", 
x ()G) 
a+b=n 
Problem 5.12 ([3]). For integers m,n > 0, prove that 
Ss" yr(") as ') _ f m=—0 
= a 0 m>Q0. 
Problem 5.13 (([6]). Let m <n be positive integers. Compute 


> (2) Gn) 


k=m 


Problem 5.14 ([4]). For m,n > 1 compute 


MES es ean 


k>0 


86 Author’s Pick 


Problem 6.1 (AMSP 2011 NT3 Exam). Let be the Mobius function. For n > 1, 


evaluate . 
OIE 
k=1 


Problem 6.2 (USAMO 2010/5, Titu Andreescu). Let g = a where p is an odd prime, 


and let 
1 1 1 


S,= { fp... 4 . 
77 2-3-4 5-6-7 q(q + 1)(q+ 2) 


T T 
Prove that if 5 — 2S, = > for integers m and n, then m — n is divisible by p. 


Problem 6.3 (Princeton Individual Finals 2015, Xiaoyu Xu). Let p be an odd prime. 
Prove that p? | 2? — 2 if and only if 
1 1 1 


ae oF Gan @-Deop =" (mod p). 
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Problem 6.4 (Reed Dawson, [4]). For n > 0, compute 


EC) () (G4) 

a k; k 4 
Problem 6.5 (NIMO 24.8, Evan Chen). For a complex number z # 3,4, let F(z) denote 
the real part of Compute 


fe (eae) Ae 
0 


Problem 6.6 (NIMO 14.8, Evan Chen). Let x be a positive real number. Define 


Po oe 
(3-2)(4-z) * 


gbkt1 gpkt2 


Oo gk co 
A=) Gar a 3 GEE D! ae aera 


k=0 


Given that A? + B? + C? + 8ABC = 2014, compute ABC. 


Problem 6.7 (OMO 2013 F24, Evan Chen). The real numbers ao, a@1,...,@2013 and 
bo, by, eR 62013 satisfy 


1 1 
in = ag V2n + 24-a,-4 and b= og vent 2 — bn-1 


for every integer n = 1,2,..., 2013. If ag = b2913 and bo = a2913, compute 
2013 
>, (aybp_1 — @p—10g) - 
k=1 


Problem 6.8 (Shortlist 2014 N6). Let a1 < az <--- < dy be pairwise coprime positive 
integers with a, being prime and a; > n+ 2. On the segment I = (0, a,a2---a,] of the 
real line, mark all integers that are divisible by at least one of the numbers aj, ..., Gn. 
These points split J into a number of smaller segments. Prove that the sum of the squares 
of the lengths of these segments is divisible by ay. 


Problem 6.9 (OMO 2014 $25, Michael Kural). Compute 


Problem 6.10 (OMO 2015 F30, Michael Kural). Ryan is learning number theory. 
He reads about the Mobius function uw : N — Z, defined by u(1) = 1 and p(n) = 
-»>> dinaza u(d) for n > 1. However, Ryan doesn’t like negative numbers, so he invents 
his own function: the dubious function A : N > N, defined by the relations A(1) = 1 and 


Evan Chen (October 13, 2016) Summations 
§7 Hints 


2.7 Telescoping. S = 3/2. 


2.8 By contradiction and intermediate value theorem. Expand the geometric series and 
swap the order of summation. 


2.9 ae Ravi substitution. 
2.10 Roots of unity filter on —1+ (14+ 2)(1+ x?) (1 + x?) 1. (1+ Caen? 
3.8 Lemma 3.6. 


3.9 Repeat the proof of Wolstenholme by adding opposite terms. 


3.10 Answer is 9. Write as ot1 for 7 #4 +10. Pair up and cancel all the other remaining 
+1 
terms and cancel. 


4.11 Both sides are multiplicative, so check it on prime powers. 


4.12 You will need to compute >> ain(—1)°. Then swap the order of summation with 
the floor again. 


4.13 Let F'(n) be the right-hand side. Show that F «1 =o. 


1 
5.11 Square of Fa: 


5.12 Product of (1+ (—ax))” and oP 
5.13 Snake Oil. (”")2"-™ 
5.14 Snake Oil. ("~4). 


6.1 It equals 1. Switch order of summation with floor as before. Use the fact that 
dain H(d) = 6(n). 

6.2 Partial fractions. 

6.3 Partial fractions, plus Lemma 3.6. 


6.4 2-2"(7"). Snake Oil. 


6.5 Don’t be scared by the integral. Partial fractions then geometric series. Switch the 
sum and integral. 


6.6 Answer is 183. Show that A? + B? + C? — 3ABC = 1. Roots of unity filter with e”. 
6.7 Look at (a@n — Gn—1)(bn + bn—1). 


6.8 Don’t be scared that this is an N6. Let p = a, and look at intervals of the form 
[cp,cp + p|. Break the sum into several organized parts, and show that each part has 
sum zero. You will likely need Lemma 3.4. 
6.9 wr Swap the order of summation, and telescope the inverted 4) binomial 
coefficients into Coa) Then telescope again. 
6.10 Answer: a. Let f(i,7) = A(3'5/). Find a recurrence for f, and deduce that the 


corresponding two-variable generating function obeys F(x, y) = 5 (1 | Ty 7%): 


Compute the s° coefficient of F(s, i). 
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